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Abstract 

We study some properties of the Cayley graph of the R. Thompson’s group F in 
generators xq, x\ . We show that the density of this graph, that is, the least upper bound 
of the average vertex degree of its finite subgraphs is at least 3. It is known that a 2- 
generated group is not amenable if and only if the density of the corresponding Cayley 
graph is strictly less than 4. It is well known this is also equivalent to the existence of 
a doubling function on the Cayley graph. This means there exists a mapping from the 
set of vertices into itself such that for some constant K > 0, each vertex moves into 
the distance at most K and each vertex has at least two preimages. We show that the 
density of the Cayley graph of a 2-generated graph does not exceed 3 if and only if the 
group satisfies the same condition with K = 1. Besides, we give a very easy formula to 
find the length (norm) of a given element of F in generators xo, x\. This simplifies the 
algorithm by Fordham. The length formula may be useful to find the general growth 
function of F in generators xo, xi and the growth rate of this function. In this paper 
we show that the lower bound for the growth rate of F is (3 + p5)/2. 


Introduction 

The Richard Thompson group F can be defined by the following infinite group presentation 

(x 0 ,xi,x 2 ,... | XjXi = XiX j+ 1 (i < j) ). (1) 

*This research is partially supported by the RFFI grant 99-01-00894 and the INTAS grant 99-1224. 
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This group was found by Richard J. Thompson in the 60s. We refer to the survey [6!] for 
details. (See also 0SI-) It is easy to see that for any n > 2, one has x n = x 0 (n ^xyx^ 1 
so the group is generated by xo, x\. It can be given by the following presentation with two 
defining relations 


, x 0 , Xi I x , xl° = x^ 1 ), (2) 

where a b = b~ l ab by definition. 

Each element of F can be uniquely represented by a normal form , that is, an expression 
of the form 

rv* , ry* . ... Of * . q* ^ . no ^ no \ 

lL j2 1 ’ V' J / 

where s, t > 0, 0 < A < i 2 < • • • < i s , 0 < j\ < j 2 < • • • < jt and the following is true: if (||) 
contains both x* and xf 1 for some i > 0, then it also contains x i+ i or xf+± (in particular, 
~f~ jt)- 

Equivalent dehnition of F can be done in the following way. Let us consider all strictly 
increasing continuous piecewise-linear functions from the closed unit interval onto itself. Take 
only those of them that are differentiable except at hnitely many dyadic rational numbers 
and such that all slopes (derivatives) are integer powers of 2. These functions form a group 
under composition. This group is isomorphic to F. Another useful representation of F 
by piecewise-linear functions can be obtained if we replace [0,1] by [0, oo) in the previous 
definition and impose the restriction that all functions on infinity have the form t t + c, 
where c is an integer. 

The group F has no free subgroups of rank >1. It is known that F is not elementary 
amenable (EA). However, the famous problem about amenability of F is still open. If F is 
amenable, then it is an example of a hnitely presented amenable group, which is not EA. 
If it is not amenable, then this gives an example of a hnitely presented group, which is 
not amenable and has no free subgroups of rank > 1. Note that the hrst example of a non- 


amenablc group without free non-abelian subgroups has been constructed by Ol’shanskii [^2 
(The question about such groups was formulated in [h|], it is also often attributed to von 
Neumann [^L]].) Adian |1|] proved that free Burnside groups with m > 1 generators of odd 


exponent n > 665 are not amenable. The hrst example of a hnitely presented non-amenable 
group without free non-abelian subgroups has been recently constructed by Ol’shanskii and 


Sapir [^]. Grigorchuk [|I3j constructed the hrst example of a hnitely presented amenable 
group not in EA. 


The author thanks Matt Brin and Goulnara Arjantseva for helpful remarks. 


1 Density 

By the density of a Unite graph T we mean the average value of the degree of a vertex in 
T. More precisely, let V \, vy be all of vertices of T. Let deg r (u) denote the degree of a 
vertex v in the graph T, that is, the number of oriented edges of T that come out of v. Then 

wrn deg r (ui) + • • • + deg r (u fc ) 

h(T) =- - - (4) 
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is the density of T. 

Let G be a group generated by a finite set A. Let C(G,A) be the corresponding (right) 
Cayley graph. Recall that the set of vertices of this graph is G and the set of edges is 
G x A ±l . For an edge e = ( g,a ), its initial vertex is g, its terminal vertex is ga, and the 
inverse edge is e _1 = (ga, a -1 ). The label of e equals a by definition. For the Cayley graph 
C = C(G, A) we define the number 


5(C) = sup 5(T), (5) 

r 

where F runs over all finite subgraphs of C — C(G,A ). So this number is the least upper 
bound of densities of all finite subgraphs of C. If C is finite, then it is obvious that 5(C) = 
5(C). So we may call 5(C) the density of the Cayley graph C. 


Recall that a group G is called amenable whenever there exists a finitely additive nor¬ 
malized invariant mean on G, that is, a mapping p:V(G) —> [0,1] such that p(A U B) — 
g(A) + g(B) for any disjoint subsets A,BCG, p(G) = 1, and g(Ag) = p(gA) = g(A) 
for any A C G, g G G. One gets an equivalent definition of amenability if only one-sided 
invariance of the mean is assumed, say, the condition p(Ag) = g(A) (A C G, g G G). The 
proof can be found in Tjj . 

The class of amenable groups includes all finite and abelian groups. It is invariant under 
taking subgroups, quotient groups, group extensions, and ascending unions of groups. The 
closure of the class of finite and abelian groups under these operations is the class EA of 
elementary amenable groups. A free group of rank > 1 is not amenable. There are many 
useful criteria for (non)amenability [[TT] , [20| , [L4|. We need to mention the two properties of a 
finitely generated group G that are equivalent to non-amenability. 


NAi. If G is generated by m elements and C is the corresponding Cayley graph, then 
the density of C has no maximum value, that is, 5(G) < 2m. 

Note that if NAi holds for at least one finite generating set, then the group is not amenable 
and so the same property holds for any finite generating set. For the proof of this property, 
we need to use the well-known Fglner condition 0- For our reasons it is convenient to 
formulate this condition as follows. 

Let G be the Cayley graph of a group. By dist(w, v) we denote the distance between two 
vertices in G, that is, the length of a shortest path in G that connects vertices u, v. For any 
vertex v and a number r let B r (v) denote the ball of radius r around v, that is, the set of 
all vertices in G on the distance at most r from v. For any set Y of vertices, by B r (Y ) we 
denote the r-neighbourhood of Y , that is, the union of all balls B r (v), where v runs over Y. 
By BY we denote the boundary of Y , that is, the set B\(Y) \ Y. The Fqlner condition (for 
the case of a finitely generated group) says that G is amenable whenever inf ffdY/ffY = 0, 
where the infimum is taken over all non-empty finite subsets in G in a Cayley graph of 
G in finite number of generators (this property does not depend on the choice of a finite 
generating set). Any finite set Y of vertices in G defines a finite subgraph (also denoted by 
Y). The degree of any vertex v in G equals 2m, where m is the number of generators. We 
know that exactly deg y (u) of the 2m edges that come out of v , connect the vertex v to a 
vertex from Y. The other 2m — degy(u) edges connect v to a vertex from dY. Note that 
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each vertex of dY is connected by an edge to at least one vertex in Y. This implies that the 
cardinality of dY does not exceed the sum £](2 m — deg Y (n)) over all vertices of Y. Dividing 
by ffY (the number of vertices in Y) implies the inequality #< dY/ffY < 2m — 5(Y). If 
5(C) = 2m, then Y can be chosen such that 5(Y) is arbitrarily close to 2m so ffdY/ffY will 
be arbitrarily close to 0. On the other hand, for any vertex v in Y there are at most 2m 
edges that connect v to a vertex in Y. Therefore, the sum ~ dy(v)) does not exceed 

2mjpdY. So 2m — 5(Y) < 2mffdY/ffY. If the right hand side can be done arbitrarily close 
to 0, then 5(Y) approaches 2m so 5(C) = 2m. 


NA 2 . If C is the Cayley graph of G in a Unite set of generators, then there exists a 
function 0: G —> G such that a) for all g G G the distance dist (g,<p(g)) is hounded by a 
constant K >0, b) any element g G G has at least two preimages under 0. 


An elegant proof of this criterion based on the Hall-Rado theorem can be found in [0, 
see also m • Note that this property also does not depend on the choice of a finite generated 
set. A function 0 from NA 2 will be called a doubling function on the Cayley graph C. 

We need a definition. Suppose that NA 2 holds for the Cayley graph of a group G for the 
case K = 1. Then we say that the Cayley graph C is strongly non-amenable. The function 
(j)\G —> G will be called a strong doubling function on the Cayley graph C. Note that each 
vertex is either invariant under 0 or it maps into a neighbour vertex. We know that NA 2 
holds if and only if the group is not amenable, that is, 5(C) < 2m. Now we would like to 
find out what happens if the Cayley graph of a 2-generated group is strongly non-amenable. 


Theorem 1 The Cayley graph of a group with two generators is strongly non-amenable 
if and only if the density of this graph does not exceed 3. 


Proof. Let G be a group with 2 generators and let C be the corresponding Cayley graph. 
It follows from the proof of ['7|, Theorem 32] that C is strongly non-amenable if and only if 
the doubling inequality holds, that is, ffBi(Y) > 2ffY for any finite set Y of vertices in C. 
Indeed, if C admits a doubling function 0 from NA 2 with K = 1, then 0~ 1 (T) has at least 
2 ffY elements and it is contained in B\(Y). To prove the converse, one needs to consider a 
bipartite graph with two classes of vertices both equal to G. Two vertices from the different 
classes are connected by an edge whenever the distance between them in C does not exceed 
K — 1. The doubling inequality implies that the conditions of the Hall-Rado theorem hold. 
Therefore, the bipartite graph has a perfect (2, 1)-matching. This means that C admits a 
doubling function with K — 1. 

The “only if’ part is trivial since the doubling inequality is equivalent to the fact that 
pfdY > jfY for any finite subset Y in C. We know that ffdY/ffY < 2m — 5(Y) for any 
non-empty finite subset Y C C, where m = 2. This means that 8(Y) < 2m — 1 = 3 and so 
5(C) < 3. 

Now suppose that 5(C) < 3. Let Y be a finite subgraph of C with k vertices. For 
each 0 < s < 4, let q s be the number of vertices in Y that have degree s in Y. Clearly, 
k = qo + Qi + Q 2 + Q 3 + Qi- The number of oriented edges in Y is the sum of all degrees of 
vertices, that is, qi + 2 q 2 + 3 q 3 + 4g 4 . If we divide this number by k, then we get 5(Y). Since 
5(C) < 3, we have 5(Y) < 3, which is equivalent to the inequality q\ + 2 q 2 + 3^3 + 4g 4 < 
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3 k = 3 (q 0 + qi + Q 2 + <?3 + ( U) and so it can be rewritten as 3g 0 + 2q { + q 2 — <?4 > 0. For any 
finite subgraph Y, we denote the number 3g 0 + 2qi + q 2 — q± by q(Y). 

Suppose that the doubling inequality does not hold. Let us choose a minimal counterex¬ 
ample Y to the doubling inequality, that is, a finite subgraph Y in C with the property 
ffB\{Y) < 2#H for which the number q(Y) > 0 takes the smallest possible value. Each 
vertex in dY = Bi(Y) \ Y is connected by an edge with at least one vertex in Y. Suppose 
that some vertex v in dY is connected with at least two vertices in Y. Let Y’ be the subgraph 
in C with the new vertex v added to Y and two new non-oriented edges that connect v with 
vertices V\ , v 2 in Y. By definition, Lfi( Y') = Bi(Y) U Bi(v). The vertex v has exactly 4 
edges that come out of v. At least two of them connect v with a vertex in Y. So Bi(v) may 
contain at most 2 vertices not in Bi(Y) (the vertex v itself belongs to Bi(Y)). This means 
that #Bi(Y') < #Bi(Y) + 2 < 2 ffY + 2 = 2 jfY'. Hence Y' is also a counterexample. To 
complete the proof, we need to check that q(Y') < q(Y). 

When we add a non-oriented edge that connects v and v 3 (j = 1,2), then the degree of 
Vj increases by 1. This means that if deg Y (vj) = i, then deg Y ,(vj) — i + 1. So 0 < i < 4, the 
value of qi decreases by 1, the value of q l+ \ increases by 1. Clearly, q(Y) — 3qo + 2qi + q 2 — q± 
decreases by 1. So if we add the edges for both V\ and v 2 (this does not exclude the case 
v i — v 2 ), then q(Y) decreases by 2. But we also have a new vertex v that has degree 2 in 
Y'. Thus q 2 increases by 1 and so after all these operations we have q(Y') = q(Y) — 1. This 
contradicts the minimality of Y. 

The proof is complete. 

Note that the proof of Theorem |T] goes without any changes if we apply it to any regular 
graph of degree 4. Also we have to mention that the density of a Cayley graph of a group is 
closely related to an isoperimetric constant t* of a graph (see the definition in [|j7j). Namely, 
one has the equality t*(C) + 5(C) = 2 m for the Cayley graph C of an m-generated group. 

Theorem [3] applied to the Cayley graph C 2 of F in generators x 0 , X\ means that if we 
cannot find a subgraph in C 2 with density greater than 3, then there exists a doubling function 
on C 2 . One can imagine this doubling function in the following way. Suppose that a bug 
lives in each vertex of C 2 . We allow these bugs to jump at the same time such that each bug 
either returns to its initial position or it jumps to a neighbour vertex. As a result, we must 
have at least two bugs in each vertex. 

It is natural to ask how much the value of 5(H) can be for the finite subgraphs we are 
able to construct. It is easy to see that in each finite subgraph Y of C 2 there exists a vertex 
of degree at most 2. Indeed, if all vertices were of degree 3 or 4, then one could travel along 
Y by positively labelled edges only (if we enter a vertex by an edge labelled by x 0 or x l7 
then we can leave this vertex travelling along an edge with one of these labels). But F does 
not have nontrivial relations that involve positive letters only. 

However, the following result shows that we are able to construct finite subgraphs in C 2 
with the density arbitrarily close to 3. 

Theorem 2 Let C 2 be the Cayley graph of F in generators x Q , x, \. For any integers 
m,n > 1 there exists a finite subgraph T ntTn in C 2 such that 

lim 5(f n m ) = ^-jp (6) 

m^oo 2n — 1 
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for any n > 2. In particular, S(C‘ 2 ) > 3. 

Proof. It is convenient to work with subgraphs in C = Coo. Note that all Cayley graphs 
Ci (?' = 2,3,...) are subgraphs in C. Let Y be a finite connected subgraph in C. Assume 
that Y is full, that is, if two vertices of Y can be connected in C by an edge, then this edge 
belongs to Y. It is not hard to see that any finite subset in F determines the corresponding 
full subgraph, which will be finite. 

Let Y be a finite connected full subgraph in C. By a rank of a vertex v in Y we mean 
the maximum number k > 0 such that v has an edge in Y labelled by xf 1 that comes out of 
v. Suppose that for some i > 0, the rank of any vertex in Y is greater than i. Under these 
conditions, we define a subgraph in C denoted by Ai(Y). 

Let vi, ..., Vk be the list of all vertices in Y (they are elements of F). Let rq, ..., ry 
be the ranks of these vertices (each of these numbers exceeds i). For any 1 < j < k, we 
consider the set of vertices of the form v 3 xf s , where 0 < s < Tj —i. We call this set a column 
of Vj. The number of vertices in this column equals r 3 —i (if r — j = i + 1, then the column 
consists of Vj only). The vertices in a column are connected in C by edges with label x t . The 
number of these edges in the column of v 3 equals r 3 — i — 1. Note that all the columns are 
disjoint. The union of them will be the set of vertices of the graph Ai(Y) we want to define. 

Now let us take an arbitrary edge from Y. Suppose that it connects v' and v". Let r', 
r" be the ranks of these vertices, respectively. Without loss of generality we may assume 
that our edge has label Xk, where k > i so we have v r Xk = v". If k — i + 1, then we do 
nothing with this edge. If k — i + 2, then v'x~ ] and v" xf 1 both appear in the columns of 
v' and v". In the graph C , these vertices are connected by an edge labelled by x i+ i since 
v'xf l x i+ i = v'x i+ 2 Xf l = v"xf l . In general, if k — i + p + 1, where p > 1, then the rank of 
each of v', v" is at least i + p + 1. So the elements v'xf*, v"xf' will appear in the columns for 
all 1 < t < p. From the defining relations of F it follows that v'xf *, v"x~' are connected in C 
by an edge with label Xk~t • Indeed, v'x^Xk-t = v'xkxf 1 = v^xf 1 (we used the fact k — t > i 
and the relations of the form xf 1 Xj = x 3+ pxf 1 that hold in F for all j > i). 

So for any edge e labelled by Xk from v' to v" in Y (k > i + 1), we have k — i edges 
(including e) that connect a vertex in the column of v' with a vertex in the column of v". 
These edges are called parallel to e. The union of all these edges over all edges e in Y forms 
the set of edges of Ai(Y). 

So we have defined the new graph Ai(Y). It is finite, connected, and it contains Y. Let us 
check that Ai(Y) is full. Suppose that two vertices v'xf s and F'xf 1 of Ai(Y) are connected 
by an edge / with label Xk■ We have inequalities 0 < s < r' — i, 0 < t < r" — i, where r', r" 
are the ranks of v ', v", respectively. The graph Y is connected so there exists a path from 
v' to v" in Y. The label of this path is a word W that involves only letters of the form xf 1 , 
where j > i. Equalities v'W = v" and v'xf s Xk = v"xf f imply xfW = Xkx\. We consider 
several cases. 

1) k < i. In this case Xk belongs to the subgroup of F generated by Xk+ 1 , Xk+ 2 , ..., 
which is impossible. 

2) k = i. In this case xl~ s+1 = W belongs to the subgroup generated by ay+i, 2 ^+ 2 , • • • • 
This can happen only if the exponent t — s+1 equals 0 and W = 1 in F. So v' = v" = v and 
our vertices have the form vxf s , vxf^ s+1 \ These vertices are connected by the edge with 
label Xi in ^4j(T'). 
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3 ) k > i. The word W~ 1 xJ s XkX t i equals 1 in F so the algebraic exponent sum over all 
letters with the smallest subscript must be zero. In our case, the smallest subscript is i so 
s = t. Therefore, W = x^ s XkX\ = Xk+ S in F. The graph Y is full so the edge e labelled 
by Xk+s that connects vertices v\ v" must belong to Y. Then our edge / labelled by Xk is 
parallel to e and thus belongs to Ai(Y) by definition. 

Note that the rank of each vertex in AiiY) is at least i. This means that we can apply 
the operator A[ to Ai(Y ) for any 0 < i' < i. We start with the “linear” graph m that 
consists of m + 1 vertices 1, x~ x , ..., x~ m and m positive edges labelled by x n that connect 
these vertices. It is obvious that S njm is full and all vertex ranks are equal to n. We can 
apply the operator A n -2 to it ( A n -i can be also applied but this is useless). After that, we 
can apply A n - 3 , ..., A±, Aq. As a result we get the family of subgraphs 


T 


n,m 


AqA\ ■ ■ ■ A n -2^ 


n,m* 


(7) 


Each of them is a subgraph in C^. If we erase all edges of T n m that have labels of the form 
x^ 1 , where k > 1, then we get the subgraph T n rn in C 2 from the statement of our Theorem. 

It is possible to characterize all vertices of T n m as some words with negative expo¬ 
nents. We only mention that all of them will be of the form x~ Sn x n %~ 2 ■ ■ -Xi Si Xq S °, where 


so, Si,..., s n — 2 , s n ^ 0. 


Let Z be a subgraph in C^. By a star of a vertex v in Z (or a Z-star) we mean the set of 
labels of all edges that come out of v and belong to Z. Suppose that Z', Z" are two subsets 
of the set of vertices of Z and A: Z' —> Z" is a bijection. If the Z-star of each vertex v G Z' 
coincides with the Z-star of its image A(r>), then we say that A is a local isomorphism (in 
Z). Now let Y be a finite connected full subgraph in such that all its vertices have rank 
> i. For any subset Y' of Y we can denote by AiiY') the union of columns of all vertices 
from Y'. (Note that each full subgraph is determined uniquely by the set of its vertices 
so this notation does not lead to a confusion. For a single vertex v we shall write Ai(v) 
instead of Zlj({u}).) It is worth noting that if two vertices v,w eY have the same F-star, 
then they have the same rank r and there exists a natural bijection between their columns, 
vx~ k 1 —► wx^ k , where 0 < k < r. Obviously, the A;(F)-stars of vx^ k and wx^ k are also the 
same. So we can conclude that if Y' and Y" are disjoint subsets and A is a local isomorphism 
between Y' and Y" in Y, then the sets Ai(Y r ) and AiiY") will be also disjoint and there 
exists a local isomorphism between them in A t . 

Now we can consider two vertices x~ s , x~ f in S„ jm , where 0 < s < t < m. Obviously, the 
sets { 2 ^} and {x~ f } arc locally isomorphic disjoint subsets in H„ )m . Applying the argument 
from the above paragraph, we see that the sets V s and V) are disjoint locally isomorphic 
subsets in T n)m , where 14 = AqA\ ■ ■ ■ ^4 n _ 2 (S n)m ) by definition for any 0 < k < m. It is clear 
that all the sets Vo, Vi, • • •, V m form the disjoint subdivision of the set of vertices of T nim . 
These sets have exactly the same number of vertices. Let pk (0 < k < m) be the average 
degree of a vertex in the subset 14 , that is, the number J2 v &v k deg(u)/#I4 (the degree is 
taken in the whole Cayley graph). We have p\ — ■ ■ ■ — p m -1 = P- The density of T„ )m will 
be equal to 


5(r n , m ) = 


P0 + Pi + ‘ ‘ ' + Pm -1 + Pn 


( 8 ) 


m + 1 

Since po and p m are bounded, the limit of (||) as m approaches 00 , is exactly p. Thus to 
complete the proof, we need to show that p = 6(n — 1)/(2 n — 1). 
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To calculate p. it is convenient to introduce the new family T n of auxiliary finite graphs 
(they are no longer subgraphs of Coo). But they will be still labelled graphs (automata) and 
it will be clear from the definition of them that the set of vertices of T n is in a Injection with 
the set of vertices of 14 for any 1 < k < m. Moreover, each vertex in T„ have the same star 
as the corresponding vertex in 14. Thus p will be equal to <5(T n ). The idea is to extend the 
notion of A%. Let Y be any labelled finite graph, where all edges have labels of the form 
x^ 1 , j > 0. The rank of a vertex is defined in the same way. Suppose that all vertices in Y 
have rank > i. For each vertex v of rank r we consider the set of r — i vertices u 0 , iy, ..., 
where u 0 = v. These vertices form the column of v. We connect them by r — i — 1 
directed edges labelled by ay, where the kth edge (1 < k < r — i) goes from 04 . to 1 - (It is 
easy to see that 04 here is an analog of vx~ k in the above construction.) Then for any edge 
e of Y that has label Xj and goes from v to w, we consider j — i edges eo, ei, ..., e 3 -i-\. 
Namely, for any 0 < k < j — i, we connect 14 and uy by an edge labelled by Xj-k- Clearly, 
eo = e. So we get a graph that contains Y and we denote it by Ai(Y), as above. It is easy 
to see that we have an extension of the above concept. Also it is clear that we can repeat 
applications of the operators A, with decreasing subscripts. If we start with the graph S n 
that has a single vertex and a loop labelled by x n at this vertex, then we get the graph 

1-4 MqMi ' ' ' A n — 2 — ‘n- 

By f n we denote the graph obtain from T„ by erasing all edges labelled by xf 1 , where i > 2. 
Since the vertex of S n has the same star as the vertex x~ k in S njm for any 1 < k < m, we 
easily conclude from our definitions that d(f n ) = Pk = P- 

The graphs T n (n > 1) can be easily drawn explicitly. If a vertex v has a loop at v 
labelled by x m , then it will be clear that the rank of v equals m. So we can draw this 
vertex as a circle with the number m inside. If Y is a labelled graph with labels of the 
form x 3 (j > 0), then by T {Y) we denote the graph obtained from Y by increasing all 
subscripts of the labels by 1. We know that Tx is a single loop labelled by ay. To obtain 
T n+ i from T n (n > 1), one has to apply T to T n and then apply Aq. This is true because 
r n+ l Ao(A\ ' ' ' A n — l^n+l ) = A(^(-4o • • • A n - 2 ^n-i)) = Mo(^(T n )). This gives an easy 
way to imagine how these graphs look like. We illustrate this process by the following picture 
that shows T n for 1 < n < 4. 







Xi 



Xi 


@^HD 


r 4 r 2 r 3 


r 4 


Lemma 1 Let a n k (1 < k < n) be the number of vertices ofY n that have rank k. Then 

k(2n — k — 1)! 




(n — k) \ n\ 

The total number of vertices in T n equals the nth Catalan number, that is, 


(9) 


(2m)! 

n\ (n + 1)! 


Proof. We proceed by induction on n. Obviously, an = 1, which agrees with the formula. 
Each vertex of rank k in T n , where 1 < k < n, becomes a vertex of rank k + 1 in T(r n ). After 
applying Aq to T (T n ), the column of this vertex will contain exactly one vertex of each rank 
from 1 to k+1. Then the number of vertices of rank i in T n+ i = ^(^(T^)) will be equal to the 
sum ][] i<fc+1 a n k■ Therefore we have the recursive formulas a n+ i,i = a n+ i ,2 = a n i + • ■ • + a nn , 
®n+ 1,3 ®n 2 T ■ ■ ■ T a nn , • • •, a n _|_i,n 1 T a nn , a n+ i,n+i (inn- By the inductive 

assumption, a n+ i,n+i = = 1. Suppose that we have already proved formula (|9|) for 

a nj fc + i, where 2 < k < n. Then 


Q>n-\-l,k 


(k — l)(2n — k)\ t (k + l)(2n — k)\ 

dri,k—l + ®re+l,fe+l / 7 - -i \ i I ( t \ | / iiU 

(n — k + 1)\ n\ (n — k) \ [n + 1)! 

(2 n — k)\ ( k — 1 fc + l\ k(2n — fc + 1)! 

(n — k)\n\ \n — k + 1 n + lj (n — k + 1)! (n + 1)!’ 
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which proves (0) for a n+ i^. Finally, 


2(2n — 1)! _ (2n)! 

° n+ i’i Qn+1,2 (n — 1)! (n + 1)! n! (n + 1)! ’ 
which proves 0) for a n+ ij. 

Note that the total number of vertices in T n will be equal to a n ± + • • • + a nn = a n+ i t i = 
(2 n)\/ ( n\ (n + 1)!), which is the nth Catalan number. This completes the proof of the Lemma. 


Now let b nk will be the number of edges in T k labelled by x k , where 0 < k < n. (Recall 
that if we denote a vertex by a circle with the number m inside, then this vertex has a loop 
labelled by x m .) 


Lemma 2 For any n > 2, 


b-nO b n | 


3(2n - 2)! 


(n — 2)! (n + 1)!' 


Proof. (Remark that b\o = 0, bn = 1.) We would like to establish a more general formula 


bnk 


(fc + l)(2n — k-2)\ 
(n — k) \ (■n + 1)! 


■ (3n 2 — 3 n(k + 1) + k 2 + 2k) 


( 10 ) 


for any 0 < k < n together with b nn = 1. The conclusion of the Lemma is a partial case 
of these. We proceed by induction on n. The case n = 1 is obvious. First of all, we 
want to give recursive formulas for b n+ \^ (0 < i < n + 1). Let v be a vertex of rank k in 
r n (1 < k < n). It becomes a vertex of rank k + 1 in T(r n ) and thus creates exactly k 

edges labelled by x 0 in ^(^/(r^)) = r n+ i. This implies fe n +i,o = a n i + 2 a n2 + -h na nn . 

Using the recursive formulas from the proof of Lemma [lj, one can rewrite this as fe n +i,o = 

(Onl + ' ' ' + CLnn ) + (On2 + ' ' ' + ®im) + ' ' ' + CL nn = a n+ 1,2 + On+1,3 + ' ' ' + On+l,n+l = «n+2,3- 

Therefore, b n+ i j0 = 3(2n)!/((n — 3)!n!), which proves (p~0| ) for & n+ i i0 - 

Let e be an edge labelled by Xk in r„ (0 < k < n). It becomes and edge labelled by Xk+i 
in tf(r n ) so it creates one edge of each of the ranks from 1 to k + 1 in ^(T (r n )) — r n+ ,. 
Thus b n+ i } i will be the sum J2i<k+ib n k for any 1 < i < n + 1. We have the recursive 


formulas b n+ iq = b n0 + 


+ b nn , fen+1,2 — b n 1 + 


+ b n 


j ^n+l,n+l 


= b nn . Therefore, 


b n+ i tU+ i = b nn = 1 by the inductive assumption. Suppose that ( |T0| ) has been proved for 


^n+i,fc+i, where 1 < k < n. We have b n+ = &n,fc-i + &n+i,*;+i- Hence, using the assumptions, 
we obtain that b n+1)k equals 


k(2n — k — 1)! 

(n — k + 1)! (n + 1)! 


. ( 3n 2 _ 3nk + k 2 _ 1} + (fc + 2)(2n k 1)! (3n2 _ 3nk + k 2 + k) 

(■n — k)\ (n + 2)1 


(2 n — k — 1)! f k(3n 2 — 3 nk + k 2 — 1) (k + 2) (3 n 2 — 3 nk + k 2 + k)' 


(n — k) \ (n + 1)! 


n — k + 1 


n + 2 


(.k + l)(2n — A;)! 
(n — k + 1)! (n + 2)! 


■ (3n 2 — 3 nk + 3n + k 2 — k), 


10 













which proves (jTtJ) for b n+ i k . 

The proof is complete. 

Finally, to compute <5(f n ), one has to divide the number of oriented edges in f n labelled 
by Xg" 1 , xf 1 by the number of vertices in T n (it is the same as the number of vertices in f ra ). 
According to Lemmas [3] and |]. we have 

Mm \ _ 2(& n0 + b n i) _ 12(2 tt. - 2)! _ 77 ! (n + 1)! _ 6(77 - 1) 

a n+ 14 (77 — 2)! (77 + 1)! ( 277 )! 2t7 — 1 

for any 77 > 2. 

This completes the proof. It may be interesting to know how many vertices in f n have a 
given degree. For any n > 5, one has 

3(277-4)! _ 4(5n- 12)(277 - 5)! _ 6(277 - 5)! 

2 (77 — 2)! (77 — 1)! ’ 3 (77 —3)! 77 ! ’ 4 (77 — 5)! (77 + 1)! 

Here denotes the number of vertices that have degree d. This implies that an average 
vertex of f min has degree 2, 3, 4 with probabilities approaching 3/16, 5/8, 3/16, respectively, 
as ??7, 77 approach infinity. 

2 The Lower Bound for the Growth Rate 

By a norm of an element of F we mean the shortest length of a word in generators {xj/ 1 , xf 1 } 
that represents this element. Equivalently, the norm of an element is the distance from this 
element to the identity in the Cayley graph C 2 of the group F. 

Let b n (77 > 0) be the number of elements in the ball around 1 in C 2 of radius n. Since F 
has a free subsemigroup generated by x 0 , xi, the group F has exponential growth. But it is 
still unknown what will be the general growth function 

OO 

G(t ) = b n = &o + b\t + & 2 f 2 + • • • + b n t n + • • • 

n=0 

of the group F in canonical generators Xo, x±. Since the sequence b n is monotone and satisfy 
the submultiplicative inequality b m+n < b m b n ( 777,77 > 0), there exists the limit 

b = lim \[b 

n —>00 v 

which is called the growth rate of a group (in a given set of generators). The exact value 
of the growth rate of F in generators Xo, X\ is also unknown. Recently Burillo [j5|] found an 
exact number of positive elements in the ball of radius n (an element of F is positive if it 
is a product of positive letters x 0 , Xi, x 2 , .. .of the infinite set of generators). He also found 
the general growth function for positive elements and its growth rate. This implies the lower 
bound for the growth rate of F. Namely, Burillo proved that b is not less than the largest 
root of the equation x 3 — x 2 — 2x + 1 = 0, that is, b > 2.2469796.... We would like to 
present a better lower bound for the growth rate of F. 


11 









Theorem 3 The growth rate of the group F in generators Xq, x\ is not less than (3 + 
V5)/2 = 2.6180339.... 


Proof. We use the normal form for the elements of F found in 17]]. It was shown 
that there exists a regular spanning tree of the Cayley graph C 2 . Each element in F can be 
uniquely represented by a word of the form w(x o, aq), where w does not contain the following 
forbidden subwords: x^xf 1 (i = 0,1), x^XqXi, x^x^^xf 1 (n > 1). This normal form of 
an element does not give a minimal representative of an element in the sense of its norm 
(that is, the regular tree is not geodesic). However, for the regular language L of normal 
forms one can find the growth rate using standard methods. If a word of length n belongs 
to this language, then the norm of the corresponding element of F does not exceed n and 
so it will belong to B n ( 1). Thus b n , the number of elements in B n ( 1), will be not less than 
the number of words in L that have length n. So we begin to calculate the growth function 
of the regular language L. To draw its generating automaton, we need to subdivide L into 
seven disjoint subsets. 


1) The empty word. 

2) The words of the form Xq, where k > 1. 

3) The words that end with the letter xf 1 . 

4) The words that end with the letter X\. 

5) The words that end with the letter xf l . 

6 ) The words that end with x\Xq or x^xq. 

7) The words that end with xpx q or xf 1 ^. 

It is obvious that all the seven classes are pair-wise disjoint and their union is L. This 
allows to draw the generating automaton of L: 


X\ 



Now let A be the transition matrix of the automaton, that is, aij = 1 if there is a directed 
edge from the 7th to the jth vertex of the automaton and a t] = 0 otherwise. Obviously, 
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A = 


( 0 
0 
0 
0 
0 
0 

V 0 


0 \ 
0 
0 
0 
0 
1 

!/ 


Let c<?> be the number of directed paths of length n in the automaton from the 1st vertex 
to the vertex number p [n > 0, 1 < p < 7). Clearly, = 1, Cq P ^ = 0 if 2 < p < 7. For any 
n>l, l<p<7 one has 

c n ^ = c n-i a i,p + c^ 2 2 1 a 2p +-f Cnlia 7p . (11) 

Let C p (t ) be the generating function of c^\ that is, 

OO 

c r (t) = E + eft + ■ ■ • + c$ >s” + • ■ ■ 

n= 0 


for each 1 < p < 7. Obviously, from (|IID we obtain Cj = 1, C p (t) = t X)J=i Ci(£)°V In the 
matrix form, this means that (C'i(i),..., C 7 (t))(E—tA ) = (1,0,..., 0). Thus (C'i(t),..., C 7 (t)) 
is (1,0,..., 0)(E—tA)^ 1 , that is, the first row of (E — tA)~ l . We are interested in the function 
C(t) = C\(t) + • • • + C 7 (t). Finding the inverse matrix to E — tA and adding the elements 
in its first row, gives us 

Ci ' t ' = Jt — l)(t 2 — 3i + 1) 

which is presented by series as 

C(t) = 1 + 4 1 + 12 1 2 + 34t 3 + 92t 4 + 244t 5 + 642t 6 + 1684t 7 + 4412t 8 + 11554t 9 + • • •. (12) 


The coefficient c n on t n shows the number of directed paths of length n in the automaton 
that start in the first vertex. Hence C(t) is the generating function for the regular language 
L. The radius of convergence of the series (|T^[) will be the least absolute value of a root of 
the denominator, that is, it will be (3 — y / 5)/2. So the growth rate of L will be the reciprocal, 
that is, (3 + v / 5)/2. We have already mentioned that X^=o *+ — f° r all n > 0. So we have 
the lower bound for the growth function of F and (3 + v / 5)/2 = 2.6180339. .. will be the 
lower bound for its growth rate. This completes the proof. 

To compare the growth function of L with the growth function of F, let us show the first 
terms of the series for the function G s (t), where the coefficient on t n below shows the number 
s n = b n — 6 n _i, that is, the number of elements in the sphere of radius n in the Cayley graph 
of F. It is easy to see that the growth rates of b n and s n are the same. The coefficients of 
the series below were found on a computer. 

G s (t) = 1 + 4 1 + 12 1 2 + 36t 3 + 108t 4 + 314i 5 + 906t 6 + 2576t 7 + 7280t 8 + 20352t 9 + • • •. 

One can mention that the sequence s n /s n _i is decreasing for 1 < n < 9. Since s 9 /s 8 = 
2.7956043 ..., one can expect that the growth rate of F has this number as an upper bound. 
(Of course, this is only a conjecture.) 
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3 Representing Elements of F by Diagrams 


This background Section is devoted to the description of the representation of elements of 
F by semigroup diagrams. We need this for the result of the next Section. The contents 
of the present Section is essentially known. Detailed information about diagram groups can 
be found in |L8|. However, we need to describe a modified version of this idea based on the 


representation of F by non-spherical diagrams. 

First of all, let us recall the concept of a semigroup diagram and introduce some notation. 
To do this, we consider the following example. Let V = (a, b \ aba = b, bab = a ) be the 
semigroup presentation. It is easy to see by the following algebraic calculation 


a 5 = a(bab)a(bab)a = (aba) (bab) (aba) = bab = a 

that the words a 5 and a are equal modulo V. The same can be seen from the following 
picture 



This is a diagram A over the semigroup presentation V. It is a plane graph with 10 
vertices, 15 (geometric) edges and 6 faces or cells. Each cell corresponds to an elementary 
transformation of a word, that is, a transformation of the form p ■ u ■ q —> p ■ v ■ q, where 
p, q are words (possibly, empty), u = v or v = u belongs to the set of defining relations. 
The diagram A has the leftmost vertex denoted by t(A) and the rightmost vertex denoted 
by t( A). It also has the top path top(A) and the bottom path bot(A) from t(A) to r( A). 
Each cell n of a diagram can be regarded as a diagram itself. The above functions t, r, top, 
bot can be applied to 7r as well. We do not distinguish isotopic diagrams. 

We say that A is a (uq, u; 2 )-diagram whenever the label of its top path is W\ and the 
label of its bottom path is u; 2 . In our example, we deal with an (a 5 , a)-diagram. If we have 
two diagrams such that the bottom path of the first of them has the same label as the top 
path of the second, then we can naturally concatenate these diagrams by identifying the 
bottom path of the first diagram with the top path of the second diagram. The result of the 
concatenation of a (uq, u> 2 )-diagram and a (u> 2 , u^-diagram obviously is a (itq, u^-diagram. 
We use the sign o for the operation of concatenation. For any diagram A over V one can 
consider its mirror image A -1 . A diagram may have dipoles , that is, subdiagrams of the 
form 7r o 7r _1 , where 7r is a single cell. To cancel (or reduce) the dipole means to remove 
the common boundary of 7r and and then to identify top(7r) with bot(7r _1 ). In any 
digaram, we can cancel all its dipoles, step by step. The result does not depend on the 
order of cancellations. A diagram is irreducible whenever it has no dipoles. The operation of 
cancelling dipoles has an inverse operation called the insertion of a dipole. These operations 
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induce an equivalence relation on the set of diagrams (two diagrams are equivalent whenever 
one can go from one of them to the other by a finite sequence of cancclling/inserting dipoles). 
Each equivalence class contains exactly one irreducible diagram. 

For any nonempty word w, the set of all (w, w)-diagrams forms a monoid with the 
identity element s{w) (the diagram with no cells). The operation o naturally induces some 
operation on the set of equivalence classes of diagrams. This operation is called a product and 
equivalent diagram are called equal. (The sign = will be used to denote that two diagrams 
are isotopic.) So the set of all equivalence classes of (w, ru)-diagram forms a group that is 
called the diagram group over V with base w. We denote this group by T>(V,w). We can 
think about this group as of the set of all irreducible (w, w)- diagrams. The group operation 
is the concatenation with cancelling all dipoles in the result. An inverse element of a diagram 
is its mirror image. We also need one more natural operation on the set of diagrams. By the 
sum of two diagrams we mean the diagram obtained by identifying the rightmost vertex of 
the first summand with the leftmost vertex of the second summand. This operation is also 
associative. The sum of diagrams Ai, A 2 is denoted by Ai + A 2 . 

It is known that the group F is the diagram group over the simplest semigroup presenta¬ 
tion V = (x | x 2 = x ) with base x (note that for any base x k , where k > 1, we get the same 
group). It is not hard to compare this representation of F with other known representations. 
Say, in || and many other papers, the elements of F are represented by certain pairs of 
rooted binary trees. If an element g € F is represented by an (irreducible) (x, x)-diagram 
A, then the corresponding pair of trees can be recovered as follows. Let us consider the dual 
graph T of the graph A. The vertices of T are the midpoints of all edges of A. For each 
(x, x 2 )-cell of A we connect the midpoint of the top edge of the cell (labelled by x) with 
the midpoints of the edges e', e", where e'e" is the bottom path of the cell. This gives a 
“wedge” usually called the caret. All these carets form a rooted binary tree. Similarly, we 
do the same thing for all (x 2 , x) -cells of A. This gives the other rooted binary tree. Each 
of the two trees have the same number of leaves, that is, the vertices which are not roots of 
any carets. The ith leaf of the upper tree (if to count the leaves from the left to the right) 
coincides with the Ah leaf of the lower tree. 

Now let us compare the diagram representation of F with the representation of its ele¬ 
ments by piecewise-linear homeomorphisms of the closed unit interval [0,1]. Let A be an 
(x p , x 9 )-diagram over V. We will show how to assign to it a piecewise-linear function from 
[0, p] onto [0, q\. Each positive edge of A is homeomorphic to the unit interval [0,1]. So we 
assign a coordinate to each point of this edge (the leftmost end of an edge has coordinate 
0, the rightmost one has coordinate 1). Let n be an (x, x 2 )-cell of A. Let us map top(7r) 
onto bot(7r) linearly, that it, the point on the edge top(7r) with coordinate t € [0,1] is 
taken to the point on bot(7r) with coordinate 2 1 (the bottom path of n has length 2 so it 
is naturally homeomorphic to [0,2]). The same thing can be done for an (x 2 ,x)-cell of A. 
Thus for any cell tt of A we have a natural mapping T n from top(7r) onto bot(7r) (we call 
it a transition map). Now let t be any number in [0,p]. We consider the point o on top(A) 
that has coordinate t. If o is not a point of bot(A), then it is an internal point on the top 
path of some cell. Thus we can apply the corresponding transition map to o. We repeat 
this operation until we get a point o' on the path bot(A). The coordinate of this point is a 
number in [0, q\. Hence we have a function /a: [0,p] —> [0, q] induced by A. It is easy to see 
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this will be a piecewise-linear function. When we concatenate diagrams, this corresponds to 
the composition of the PL functions induced by these diagrams. For the group F, which is 
the diagram group T>(V,x), we have the homeomorphism from it to PLF[ 0,1]. It is known 
this is an isomorphism. 

Now let us recall that F has another representation by PL functions. Namely, let 
ff [0, oo) —> [0, cx)) (i > 0 is an integer) be the PL function that has slope 1 on [0, i], 
slope 2 on [i, i + 1] and slope 1 again on [i + 1, oo). It is easy to verify that for any integers 
j > i > 0, one has f ] f l = fifj+i (the functions act on the right). Thus the mapping x t i—>• f t 
induces a homomorphism from F to PLF[ 0, oo). It is also known this is an isomorphism. 

In the above representation of F by diagrams, all these diagrams were spherical , that is, 
they were (w, ^-diagrams for some word w. Our new representation of F by non-spherical 
diagrams, which we are going to describe, corresponds to the representation of F by the PL 
functions on [0, oo). Let A be any diagram over V = ( x \ x 2 = x ), not necessarily spherical. 
Let us add an infinite sequence of edges on the right of A, each edge is labelled by x. This 
object will be called an infinite diagram over V. Note that it has finitely many cells. An 
infinite diagram that corresponds to A will be denoted by A. It has the leftmost vertex t(A) 
and two distinguished infinite paths starting at t(A), both labelled by the infinite power 
of x. These paths will be denoted by top(A) and bot(A), respectively. The concept of 
a dipole in an infinite diagram is defined as above. The same concerns the operations of 
deleting/inserting a dipole, the equivalence relation induced by these operations, and so on. 
Any two infinite diagrams can be naturally concatenated (the bottom path of the first factor 
is identified with the top path of the second factor). We use the sign o for the concatenation. 
The operation o gives the set of all infinite diagrams a monoid structure. The identity of it 
is the infinite diagram without cells denoted by e. As in the case of spherical diagrams, the 
operation of concatenation induces a group operation on the set of all equivalence classes 
of infinite diagrams. Thus we have a group. We shall denote it by T>{V,x). (This makes 
sense for any semigroup presentation V in an alphabet of one letter. Note that we can forget 
about the labels if we work with a one-letter alphabet.) It is quite easy to see that the group 
we have will be isomorphic to F. Indeed, let be the infinite diagram 


x 



that consists of an (x, x 2 )-cell, the finite path labelled by x l on the left of it and the infinite 
path labelled by the infinite power of x, on the right. By Xfi 1 we mean the mirror image of 
X t under the horizontal axis symmetry. Infinite diagrams of the form Xf 1 (i > 0) are called 
atomic. For any integers j > i > 0, the diagram 


x x 
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equals both Xj o X t and X t o Xj +1 . This means that we have a homomorphism from F to 
the group of infinite diagrams. This homomorphism is onto because any infinite diagram 
is a concatenation of atomic diagrams. The homomorphism must be injective because all 
homomorphic images of F are abelian ||. However, the group of infinite diagrams is not 
abelian since X\X Q = X 0 X 2 7 ^ X 0 X t . 

Note that the diagram X, corresponds to the above dehned PL function f,. (The transi¬ 
tion maps allow to assign a PL function to any infinite diagram.) If we consider a dual graph 
to an infinite diagram, this gives us a pair of binary rooted forests. Each forest is a sequence 
of binary rooted trees. All of these trees but the finite number of them consists of just the 
root vertex. All leaves of a forest can be naturally enumerated. Given a pair of such forests, 
we can identify the zth leaf of the “upper” forest with the ith leaf of the “lower” forest for 
alii > 0. 

When we work with infinite diagrams, it is convenient to eliminate the infinite “tail” on 
the right of each infinite diagram. An ordinary diagram over V is called canonical whenever 
it has no dipoles and it is not a sum of a diagram and an edge. It is obvious that there is a 
one-to-one correspondence between the set of infinite diagrams without dipoles and the set 
of canonical diagrams. So we may assume that each element of F has a unique canonical 
representative. We have a group structure on the set of all canonical diagrams over V . Given 
an (x p , aP)-diagram Ai and an (x s , a^j-diagram A 2 , we multiply them as follows. If q = s, 
then we concatenate them. If q < s, then we concatenate Ai + e(x s ~ g ) and A 2 . If q > s, 
then we concatenate Ai and A 2 + e(x q ~ s ). After the concatenation, we reduce all dipoles 
in the result. Then we need to make the diagram canonical. This means that we have to 
delete the rightmost common suffix of the top and the bottom path of the diagram we have. 
The only exception is made for the diagram e(x), the identity element of the group. This 
diagram is already canonical so we leave it as it is. 

This representation of F by canonical diagrams is very convenient for many reasons. Say, 
the number of cells in the canonical diagram of an element always equals the length of the 
normal form of this element. Given a normal form, it is very easy to draw the corresponding 
diagram, and vice versa. The following example illustrates the diagram that corresponds to 
the element g = XqX\XqX^ 1 Xq ‘ 2 represented by its normal form: 



This representation of elements in F by canonical diagrams has some more advantages. 
First of all, the number of cells in a canonical diagram of an element g is obviously the 
length of the normal form of g, which is also the shortest length of a word in the infinite 
set of generators { Xq 1 , xf 1 , xf 1 ,...} that represents g. Given the canonical representative 
A of G, it is easy to construct an (x, x)-diagram that represents the same element. One has 
to add an edge on the right of A, that is, to consider the diagram A + e(x). Let v be the 
rightmost vertex of this diagram. We then connect v by arcs to all the vertices in top(A) 


17 



in the upper part and connect v by arcs to all the vertices of A in the lower part. All edges 
we add have label x. As a result, we give an (x, x)-diagram that represents g in the diagram 
group T>(V,x). This diagram has “redundant” cells — they are all cells that are not in A. 
These cells do not correspond to any generators so A looks a more “natural” representative 
of g. 

If we prefer to work with dual graphs, then, given a pair of binary rooted trees from || 
that represent an element g , one has to remove all right carets from it. The rest will be the 
pair of forests that is exactly the dual graph to the canonical representative of g. 


^-cells 


Note that the diagram in the above picture is drawn in such a way that all the ( x , x 
of it are above the straight line and all the (x 2 , x )-cells are below the straight line. This is 
an easy consequence of the fact that the normal form of an element in A is a product of the 
form pq _1 , where p, q are positive words (words in generators Xq, aq, oq, • • • with no negative 
exponents). The following elementary fact was essentially used several times in [[ 
some other papers. 


16 and 


Lemma 3 Let V — (X \ 7Z) be a semigroup presentation. Suppose that all defining 
relations of V have the form a = A, where a E X, A is a word of length at least 2. Also 
assume that all letters in the left-hand sides of the defining relations are different. Then any 
irreducible diagram A over V is the concatenation of the form Ax o Af 1 , where the top path 
of each cell of both A 1; A 2 has length 1. The longest positive path in A from if A) to r( A) 
is the bottom path of both Ai and A 2 . 

Note that ( x \ x = x 2 ) obviously satisfy the conditions of the Lemma. The same concerns 
the presentation (a, b \ a — bab, b = aba ), which was considered in the previous Section. Let 
us recall the idea of the proof. Let p be the longest positive path in A from i(A) to t(A). 
It cuts A into two parts. It suffices to prove that all cells in the “upper” part correspond to 
the defining relations of the form a = A, where a is a letter, and none of them correspond 
to A = a. Assume the contrary. Suppose that there is a cell n in the upper part of A with 
the top label A and the bottom label a. The bottom path of n cannot be a subpath in p 
since p is chosen the longest. So the bottom edge of 7r belongs to the top path of some cell 
7r'. The diagram A has no dipoles. All letters in the left-hand sides of the defining relations 
are different. So the top path of n' cannot have length 1. This means that we have found 
a new cell in the upper part of A that also corresponds to the defining relation of the form 
A = a. Applying the same argument to 7r', we get a process that never terminates. This is 
impossible since the cells that appear during the process cannot repeat. This completes the 
proof. 


4 The Length Formula 


An algorithm to fold the norm of an element of F in generators xo, x± was obtained by 
Fordham Jl2| in his PhD. The algorithm is based on the representation of F by pairs of 
binary rooted trees. Each caret of a tree belongs to one of 7 classes defined in [|12j (only 6 of 
these classes are essential). All carets in each of the trees are enumerated. For any number, 
there are 36 cases for a pair formed by the ith carets of the trees, to belong to some of the 
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classes. To each of these 36 cases one can assign a weight. The sum of these numbers will 
be the norm of an element. By this algorithm, the norm can be found in a polynomial time, 
although the algorithm itself is quite complicated. However, it is a powerful tool to solve 
various problems about F. For instance, Fordham’s algorithm is successfully implemented 
in recent papers || [8] . In this Section, we present another algorithm to find the norm of an 
element in F. We are based on the representation of F by non-spherical diagrams described 
in Section |3j. We classify vertices of the canonical diagram of a given element. There are 
only two cases for this. To find the norm, one has to add the number of cells in the diagram 
and the number of so-called special vertices multiplied by 2. This algorithm is very clear 
and easy in use. 

Let A be a diagram over V = ( x \ x 2 = x ). From Lemma ^ we know that all vertices of 
A belong to the longest positive path p from t(A) to r(A). We need to classify vertices of A. 
A non-empty positive path q in A (in particular, a single edge) is called a brigde whenever 
q is a common subpath in both top(A) and bot(A). If q is a bridge of A, then one has 
a decomposition of the form A' + q + A", where A, A" are some diagrams (possibly A" is 
empty) and q denotes the subdiagram in A that consists of the bridge q. We say that the 
bridge q is nontrivial whenever A " has cells. Note that any bridge q, being a path, has its 
initial point i(q). 

A vertex v in A is called active if v is either an initial point of a cell in A or v is an initial 
point of a nontrivial bridge of A. Since A is a finite graph, we have the distance function 
on vertices of A. The distance between two vertices is the length of the shortest path in A 
that connects these vertices. An active vertex in A is called special if its distance from the 
origin O = t(A) is strictly greater than 1. 

In the following picture 



we enumerate all vertices of the diagram A travelling along p from the origin and starting 
from the number 0. It is clear that vertices 0, 1, 3, 5, 6 are active. Vertices 5, 6 from this 
list are the only special vertices of A. 

Theorem 4 Let g £ F be represented by the canonical diagram A. Then the norm of 
g, that is, the length of the shortest word representing g in group generators xq, x\ , can be 
found by the following formula: 

\\g\\ = # c A + 2#,A, (13) 

where ff c denotes the number of cells and ff s denotes the number of special vertices of a 
diagram. 
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In the above example, one easily has # C A = 7, = 2 so ||g|| = \\xIxiXq,x^ 1 Xq 2 \ \ = 

7 + 2-2 = 11. Recall that the number of cells in the canonical diagram of g is always equal 
to the length of the normal form of g. Before the proof, we would like to verify that g is an 
example of a dead vertex in the Cayley graph of F. Namely, we would like to show that for 
any a = x^ 1 (i = 0,1) one has | \ga\ | < | \g\ \ (that is, the norm of g decreases in all directions). 
It can be shown that the number 11 is the smallest one for which this effect is possible. 

We need to analyze what happens if we multiply g by each of the four semigroup gener¬ 
ators. Let us consider 4 cases. 

1) g —> gx 0 . In this transition, when we multiply g by Xq, one cell in A cancels (the 
leftmost cell on the bottom). The set of vertices remains the same. It is also clear that 
the set of active vertices will be the same and no vertices change their distance from the 
origin. So decreases by 1, is the same. Hence the norm decreases by 1, that is, 
lls^oll = \\g\\ - 1 = 10. 

2) g —> gx i- When we multiply g on the right by aq, we cancel the second cell on the 
bottom. This means we just remove the arc that connect vertices 3 and 5. In this case the 
vertex number 3 will be no longer active. But the vertex number 4 becomes active because 
it will be the initial point of a bridge. However, the vertex number 4 is on the distance 1 
from the origin so it is not special. Thus the only special vertices in the new diagram will 
be 5 and 6, as before. This means that the norm decreases by 1 so ||yxi|| = | \g\\ — 1 = 10. 

3) g —■► gx^ 1 . Multiplying by Xq 1 on the right means that we add a new cell to the 
diagram. This means we add an arc that connects vertices 0 and 5. The set of active vertices 
remains the same. But now the vertex number 5 will be on the distance 1 from the origin so 
the only special vertex in the new diagram will be the vertex number 6. Thus increases 
by 1, decreases by 1 and so the norm decreases by 1, that is, \\gxQ || = | \g\\ — 1 = 10. 

4) g —> gxi 1 . In this case we need to add a new cell connecting vertices 3 and 6 by an 
arc. Now the vertex number 5 will not be an initial point of a bridge. The only special 
vertex in the new diagram will be 6. As in the previous paragraph, increases by 1, 
decreases by 1 and 1111 = | \g\ \ — 1 = 10. 

Proof of Theorem |Ij. We use the idea of [T^ . Lemma 2.1.1], For any g G F we consider 
the canonical non-spherical diagram A that represents g. The value # C A + 2# S A will be 
denoted by ip(g ). Our aim is to show that ||g|| = ip(g) for any g G F. 

Clearly, <p(g) = 0 if and only if g = 1 in F. We show that <p(ga) = ip(g) ± 1 for any g G F 
and for any semigroup generator a G This will imply that cp(g) < ||g|| for any 

g G F. Indeed, if we decompose g into a product of n = ||g|| generators, then one can easily 
show by induction that <p(g) < n. 

Let us consider two neighbour elements in the Cayley graph C^- For these elements we 
can compare the number of cells in canonical diagrams that represent these elements. It is 
obvious that the difference between the numbers of cells will be always 1 or —1 so we can 
assume without loss of generality that the number of cells in the canonical diagram of ga 
will be greater than the number of cells in the canonical diagram A of g by 1. 

We consider 4 cases. In each of them we need to show that the number of special vertices 
either remains the same or it decreases by 1. This will imply that the value of ip always 
increases by 1 or decreases by 1 after we multiply its argument by a generator. Recall that 


20 


X, is the diagram that corresponds to the generator ay G F {i > 0). 

Case 1. a = Xq. Since multiplying by X 0 increases the number of cells, we know that 
A has no cells whose bottom path coincides with the leftmost edge of bot(A). When we 
concatenate A and X 0 , then a new vertex appears on the bottom path of the cell of Xq. 
This vertex is on the distance 1 from the origin. So the vertex we add is not special. It is 
easy to see that no new special vertices can appear. So p increases by 1 in this case. 

Case 2. a — x\. In this case we concatenate A and X t . This gives a new vertex (on the 
bottom of the cell of Xi). Clearly, this vertex is not special since it is not an initial point of 
a cell and it is not the initial point of a bridge. The status of all other vertices will be the 
same, that is, no special vertices appear or disappear. Thus p increases by 1. 

Case 3. a = xf 1 . Let v 0 ,vi,v 2 , ... be all vertices of the bottom path of A enumerated 
from the left to the right. Multiplying by Xf 1 means that we connect vertices Vq and v 2 by 
an arc. Suppose that v 2 was a special vertex in A. In this case the number of special vertices 
in AoA^ 1 will be less by 1 than the number of special vertices in A. Otherwise the number 
of special vertices remains the same. Hence p decreases or increases by 1, respectively. 

Case 4. a = xf 1 . In the notation of the previous paragraph, one has to add an arc that 
connects vertices and v 3 in order to concatenate A and Xf k Note that vertices vq, V\ are 
not special in our diagrams. The vertex v 3 also does not change its status because the new 
arc does not connect it with the origin. Clearly, nothing happens with vertices Vk if k > 4. 
So the only vertex we need to think about is v 2 . If the distance from it to the origin in A is 
1, then the same is true for A' = Ao X-f 1 . Suppose that the distance in A from v 2 to the 
origin is at least 2. Then the same is true for A'. If v 2 was not active in A, then it could not 
be active in A'. Suppose that v 2 is an initial point of a cell in A. In this case v 2 be a special 
vertex of A. It is obvious that v 2 has the same status in A'. So the only case we need to 
check is when v 2 is an initial point of a nontrivial bridge in A. After we concatenate A and 
Xf 1 , this will no longer be true. So v 2 will not be special in A' so the number of special 
vertices decreases by 1. As a result, we have that p increases or decreases by 1. 

To complete the proof, we need to check that for any j G F, j / 1, there exists at least 
one generator a e {xq\ xf 1 } such that p decreases at its direction, that is, p(ga ) = p(g) — 1. 
This will imply the inequality ||g|| < p(g) for any g G F. Indeed, if p(g) = n, then we can 
start from the vertex g in C 2 . At each step we choose a direction that decreases p by 1. 
After n steps, we get to the element with the zero value of p, that is, to the identity. So the 
distance from g to the origin (that is, the norm of g) does not exceed n. 

We enumerate the vertices of bot(A) as above, where A is the canonical representative 
of g. Since g f 1, the diagram A is nontrivial. Suppose that v 2 is a special vertex in A. 
Then A cannot be the concatenation of some diagram A' and X 0 (otherwise v 2 is on the 
distance 1 from the origin). So if we multiply A by Xf 1 , then the number of cells increases 
by 1 and we are in the situation of Case 3. Since v 2 is special vertex of A, the value of p 
decreases by 1, that is, p{gxQ 1 ) = p(g) — 1. 

Assume that v 2 is not a special vertex of A. Let A be a concatenation of some diagram 
A' and Xf 1 . If we multiply g by X\ on the right, then the number of cells decreases and 
we are in the situation, which is inverse to the one described in Case 4 (now A is obtained 
from A' by adding a cell when multiplying by xf 1 ). We know that p{A') is always greater 
by 1 than p( A) except for one case. Let us describe it. By 7r we denote the cell in A that 


21 



corresponds to the factor X{ 1 in the concatenation A'oXf 1 . The top path of 7r is a product 
of two edges e'e". The terminal point of e! (=the initial point of e") will be denoted by v. 
According to Case 4 (note that we have to exchange the roles of A and A') we have that v 
must be the initial point of a nontrivial bridge in A'. This bridge starts with the edge e". 
So the terminal point of e" must be a special vertex of A' and also of A. But this vertex is 
v 2 , which gives a contradiction. 

Let us analyze all cases that can happen to the edge e that is contained in bot(A) (or 
its continuation) and has the endpoints V\, v 2 . Suppose that e does not belong to a cell so 
it is a bridge. This bridge should be trivial since v 2 is not special. Thus A is in fact an 
(x m , x)-diagram for some m > 1. It is nontrivial so A is a concatenation of some diagram 
A' and Xq 1 . If we look at the situation of Case 3 (in our case we go from A' to A adding 
a cell), then we see that <p(X) will be always greater than </?(A') by 1 except for one case 
when the vertex v\ is a special vertex of the subdiagram A'. This cannot happen because 
bot(A) has length 1. Thus (p(gx o) = p>(g) — 1 in this case. 

Now let e be contained in some cell of A. The only case we need to consider is the one 
when e is a part of the bottom path of an (x, x 2 )-ccll (the case of an (x 2 , x)-cell means that 
A is a concatenation of some diagram and Xf 1 , which has been already considered). Let us 
denote this ( x , x 2 )-cell by tt. The edge e may be the first or the second edge of bot(7r) so we 
have two subcases. 

Suppose that bot(7r) = e'e for some edge e'. If e' is contained in the bottom path of A, 
then it coincides with the first edge of bot(A). Thus A will be the concatenation of some 
diagram A' and Xq. From Case 1 we can extract that p>(gx q ) = <p(g) — 1 since A is obtained 
from A' by adding one cell that corresponds to Xo- If e! is not contained in bot(A), then 
the first edge of bot(A) must be a bottom edge of an (x 2 , x)-cell. This means that A is a 
concatenation of some diagram A' and X ( J ] . The situation we have is described in Case 3. 
Adding the new cell to A' that corresponds to X$ l always increases ip except for the case 
when the vertex V\ will be a special vertex of the subdiagram A'. But we know that the only 
positive edge of A that comes out of V\ is e. So V\ is not an initial point of a cell. Neither e 
is a bridge. So ip(gx 0 ) = tp(g) — 1 in this subcase. 

Finally, let bot(7r) = ee" for some edge e". If e" is also contained in bot(A), then 
A is a concatenation of some diagram and X\. From Case 2 we can easily conclude that 
ip(gx b 1 ) = <p(g) — 1. If e" is not contained in bot(A'), then there is an “angle” between 
e" and the third edge of bot(A). Then v 2 will be an initial point of some cell. However, 
v 2 cannot be on the distance 1 from the origin because otherwise the arc that connects the 
origin and v 2 would cross the edge top(7r), which is impossible. Therefore, v 2 is special. We 
have a contradiction. 

The proof is complete. Note that we have a quick procedure to find minimal representa¬ 
tives of elements of F using the above description. 
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